rank | frequency | n-gram |
---|---|---|
1 | 51014 | -a |
2 | 48182 | -n |
3 | 29306 | -i |
4 | 16502 | -s |
5 | 14922 | -r |
rank | frequency | n-gram |
---|---|---|
1 | 33203 | -an |
2 | 19430 | -ya |
3 | 13286 | -ng |
4 | 11553 | -,” |
5 | 7347 | -ah |
rank | frequency | n-gram |
---|---|---|
1 | 17195 | -nya |
2 | 10084 | -kan |
3 | 5141 | -ang |
4 | 3682 | -ing |
5 | 3452 | -asi |
rank | frequency | n-gram |
---|---|---|
1 | 4554 | -nnya |
2 | 2431 | -ngan |
3 | 1956 | -inya |
4 | 1942 | -akan |
5 | 1897 | -anya |
rank | frequency | n-gram |
---|---|---|
1 | 3885 | -annya |
2 | 1209 | -nya,” |
3 | 1134 | -angan |
4 | 1007 | -/2019 |
5 | 789 | -nesia |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings